Listing of Claims: 

1 . (Currently Amended) A speaker segmentation method for associating an at least one 
segment of speech for each of at least two sides of an at least one summed audio interaction, with 
one of the at least two sides of the interaction using additional information, the method 
comprising: 

a receiving step for receiving the at least one summed audio interaction from a capturing 
and logging unit; 

a segmentation step for associating the at least one segment with one side of the at least 
one interaction , the segmentation step comprising 

a parameterization step for transforming a speech signal into a set of feature 
vectors and dividing the set into non-overlapping segments; 

an anchoring step for locating an anchoring segment for each of the at least two 
sides of the interaction; and 

a modeling and classification step for associating at least one segment with each 
side of the interaction ; and 
a scoring step for assigning a score to said segmentation. 

2. (Original) The method of claim 1 wherein the additional information is at least one of the 
group consisting of: computer-telephony-integration information related to the at least one 
interaction; spotted words within the at least one interaction; data related to the at least one 
interaction; data related to a speaker thereof; external data related to the at least one interaction; 
or data related to at least one other interaction performed by a speaker of the at least one 
interaction. 

3. (Original) The method of claim 1 further comprising a model association step for scoring the 
at least one segment against an at least one statistical model of one side, and obtaining a model 
association score. 



4. (Original) The method of claim 1 wherein the scoring step uses discriminative information for 
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discriminating the at least two sides of the interaction. 

5. (Original) The method of claim 4 wherein the scoring step comprises a model association step 
for scoring the at least one segment against an at least one statistical model of one side, and 
obtaining a model association score. 

6. (Original) The method of claim 5 wherein the scoring step further comprises a normalization 
step for normalizing the at least one model score. 

7. (Original) The method of claim 4 wherein the scoring step comprises evaluating the 
association of the at least one segment with a side of the interaction using additional information. 

8. (Original) The method of claim 7 wherein the additional information is at least one of the 
group consisting of: computer-telephony-integration information related to the at least one 
interaction; spotted words within the at least one interaction; data related to the at least one 
interaction; data related to a speaker thereof; external data related to the at least one interaction; 
or data related to at least one other interaction performed by a speaker of the at least one 
interaction. 

9. (Original) The method of claim 1 wherein the scoring step comprises statistical scoring. 

10. (Original) The method of claim 1 further comprising:a step of comparing said score to a 
threshold; andrepeating the segmentation step and the scoring step if said score is below the 
threshold. 

1 1 . (Original) The method of claim 10 wherein the threshold is predetermined, or dynamic, or 
depends on: information associated with said at least one interaction, information associated with 
an at least one speaker thereof, or external information associated with the interaction. 

12. (Canceled). 
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13. (Original) The method of claim 12 wherein the anchoring step or the modeling and 
classification step comprise using additional data. 

14. (Original) The method of claim 13 wherein the additional data is one or more of the group 
consisting of: computer-telephony-integration information related to the at least one interaction; 
spotted words within the at least one interaction; data related to the at least one interaction; data 
related to a speaker thereof; external data related to the at least one interaction; or data related to 
at least one other interaction performed by a speaker of the at least one interaction. 

15. (Original) The method of claim 1 further comprising a preprocessing step for enhancing the 
quality of the interaction. 

16. (Original) The method of claim 1 further comprising a speech/non-speech segmentation step 
for eliminating non-speech segments from the interaction. 

1 7. (Original) The method of claim 1 wherein the segmentation step comprises scoring the at 
least one segment with a voice model of a known speaker. 

18. (Currently Amended) A speaker segmentation apparatus for associating an at least one 
segment of speech for each of at least two speakers participating in an at least one audio 
interaction, with a side of the interaction, using additional information, the apparatus comprising: 

a segmentation component for associating an at least one segment within the interaction 
with one side of the at least one interaction , the segmentation component comprising . 

a parameterization component for transforming a speech signal into a set of 
feature vectors and dividing the set into non-overlapping segments; 

an anchoring step component for locating an anchoring segment for each of the at 
least two sides of the interaction; and 

a modeling and classification component for associating at least one segment with 
each side of the interaction ; and 
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a scoring component for assigning a score to said segmentation. 

19. (Original) The apparatus of claim 18 wherein the additional information is at least one of the 
group consisting of: computer-telephony-integration information related to the at least one 
interaction; spotted words within the at least one interaction; data related to the at least one 
interaction; data related to a speaker thereof; external data related to the at least one interaction; 
or data related to at least one other interaction performed by a speaker of the at least one 
interaction. 

20. (Currently Amended) A quality management apparatus for interaction-rich speech 
environments, the apparatus comprising: 

a capturing or logging component for capturing or logging an at least one audio 
interaction; 

a segmentation component for segmenting the at least one audio interaction , the 
segmentation component comprising: 

a parameterization component for transforming a speech signal into a set of 
feature vectors and dividing the set into non-overlapping segments; 

an anchoring step component for locating an anchoring segment for each of the at 
least two sides of the interaction; and 

a modeling and classification component for associating at least one segment with 
each side of the interaction ; and 

a playback component for playing an at least one part of the at least one audio interaction. 

21 . (New) The method of claim 1 , wherein the anchoring step comprises: 

selecting a homogeneous segment as a first segment, 
constructing a first model of the homogenous segment, and 

selecting a second anchor segment such that its model is different from the first model. 

22. (New) The method of claim 21, wherein the homogenous segment is selected by spotting a 
predetermined phrase. 
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23. (New) The method of claim 1 8 wherein the anchoring component selects a homogenous 
segment as a first anchor segment, and a second anchor segment having a statistical model 
different from a statistical model associated with the first anchor segment. 
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